An Annotated Corpus In Japanese Using Tesniere's Structural Syntax
نویسندگان
چکیده
substmlces processes substantive verbal O ! adjectival adverbial A E l Except when mentioned, examples axe from tile treebaak.
منابع مشابه
Deeper syntax for better semantic parsing
Syntax plays an important role in the task of predicting the semantic structure of a sentence. But syntactic phenomena such as alternations, control and raising tend to obfuscate the relation between syntax and semantics. In this paper we predict the semantic structure of a sentence using a deeper syntax than what is usually done. This deep syntactic representation abstracts away from purely sy...
متن کاملThe Hinoki Sensebank - A Large-Scale Word Sense Tagged Corpus Of Japanese
While there has been considerable research on both structural annotation (such as the Penn Treebank (Taylor et al., 2003) or the Kyoto Corpus (Kurohashi and Nagao, 2003)) and semantic annotation (e.g. Senseval: Kilgariff and Rosenzweig, 2000; Shirai, 2002), there are almost no corpora that combine both. This makes it difficult to carry out research on the interaction between syntax and semantic...
متن کاملEvaluation of a Japanese CFG Derived from a Syntactically Annotated Corpus with Respect to Dependency Measures
Parsing is one of the important processes for natural language processing and, in general, a large-scale CFG is used to parse a wide variety of sentences. For many languages, a CFG is derived from a large-scale syntactically annotated corpus, and many parsing algorithms using CFGs have been proposed. However, we could not apply them to Japanese since a Japanese syntactically annotated corpus ha...
متن کاملUsing a Partially Annotated Corpus to Build a Dependency Parser for Japanese
We explore the use of a partially annotated corpus to build a dependency parser for Japanese. We examine two types of partially annotated corpora. It is found that a parser trained with a corpus that does not have any grammatical tags for words can demonstrate an accuracy of 87.38%, which is comparable to the current state-of-the-art accuracy on the Kyoto University Corpus. In contrast, a parse...
متن کاملJapanese Named Entity Recognition Using Structural Natural Language Processing
This paper presents an approach that uses structural information for Japanese named entity recognition (NER). Our NER system is based on Support Vector Machine (SVM), and utilizes four types of structural information: cache features, coreference relations, syntactic features and caseframe features, which are obtained from structural analyses. We evaluated our approach on CRL NE data and obtaine...
متن کامل